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ABSTRACT 

Non-long terminal repeat (non-LTR) retrotrans- 
posons are highly abundant elements that are 
present in chromosomes throughout the eukaryotic 
domain of life. The long interspersed nuclear 
element (LINE-1) (L1) clade of non-LTR retrotrans- 
posons has been particularly successful in 
mammals, accounting for 30-40% of human 
genome sequence. The current model of LINE 
retrotransposition, target-primed reverse transcrip- 
tion, culminates in a chromosomally integrated end 
product. Using a budding yeast model of non-LTR 
retrotransposition, we show that in addition 
to producing these 'classical', chromosomally 
integrated products, a fungal L1 clade member 
(Zorro3) can generate abundant, RNA-derived 
episomal products. Genetic evidence suggests 
that these products are likely to be formed via a 
variation of target-primed reverse transcription. 
These episomal products are a previously unseen 
alternative fate of LINE retrotransposition, and may 
represent an unexpected source for de novo 
retrotransposition. 

INTRODUCTION 

Non-long terminal repeat (non-LTR) retrotransposons 
are highly abundant elements that are present in chromo- 
somes throughout the eukaryotic domain of life. Members 
of the LI clade of non-LTR retrotransposons are of par- 
ticular interest, as they have amplified to high copy 
number in mammahan genomes. Sequencing projects 
have shown that 15-30% of mammahan genomes com- 
prise long interspersed nuclear element (LINE) sequence 
(1-8). An additional 8-20% of mammalian sequence is 
derived from non-LINE sequences (e.g. Alu elements, 
SVA elements and processed pseudogenes) that parasitize 
LINE machinery to retrotranspose (9). The process of LI 
retrotransposition is still occurring in humans today and is 



a source of DNA damage (10,11), de novo mutations 
(12,13) and genetic variabiHty (14-20). 

Most available information on the biology of LI clade 
members is derived from experiments with the mammalian 
LI elements. An LI element produces a bicistronic tran- 
script encoding two proteins, ORFl and ORF2 
(Figure lA). ORFl protein binds to single-stranded 
nucleic acids with high affinity and possesses nucleic acid 
chaperone activity (21-25). ORF2 encodes endonuclease 
(26) and reverse transcriptase activities (27-30). All of 
these activities are important for retrotransposition 
(26,31-33). Complete mechanistic details of how LINE 
elements retrotranspose are unclear; however, a model of 
the initial steps of reverse transcription, called target- 
primed reverse transcription (TPRT), has emerged from 
biochemical experiments with the related R2 element of 
Bombyx mori (34). Later experiments with human LI 
support this TPRT model (35), in which the 
LI ribonucleoprotein particle (RNP), consisting of LI 
RNA, ORFl and ORF2, enters the nucleus and nicks a 
chromosomal target site. Reverse transcription ensues 
using LI RNA as the template and the free y end at the 
chromosomal nick as the primer (Figure lA). Thus, LI 
cDNA synthesis and chromosomal attachment is concur- 
rent (in contrast to retroviral and LTR-retrotransposon 
cDNA synthesis). Completion of the insertion involves a 
second nick on the opposite DNA strand of the target site, 
which serves as the primer for plus strand DNA synthesis. 
Consistent with this mechanism, all characterized 
products of authentic LI retrotransposition to date have 
been chromosomally integrated (26,33,36-40). 

The Candida albicans genome contains an active yeast 
non-LTR retrotransposon called Zorro3 (41,42). 
Phylogenetic analysis of the Zorro3 reverse transcriptase 
domain places Zorro3 in the LI clade (41). Genomic 
Zorro3 elements tend to be flanked by polyadenosine 
[poly (A)] tracts (Figure lA). The poly (A) flanks are 
likely derived from three sources — the target site [Zorro3 
tends to integrate into poly (A) tracts], a possible target site 
dupHcation [if both endonuclease nicks are within the 
target site poly (A) tract], and the mRNA poly (A) tail. 
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Figure 1. A novel class of LINE retrotransposition products. (A) Structure of LI clade members and proposed mechanism of non-LTR retrotrans- 
poson replication. Top, comparison of the structures of human LI and Candida albicans Zorro3. Both elements contain two ORFs and endonuclease 
(endo), reverse transcriptase (RT) and zinc finger (zf) domains. Genomic Lis are typically flanked by target site duphcations (flanking arrows). 
Genomic Zorro3s are flanked by poly(A) tracts, which may include poly (A) target site duphcations. Bottom, general mechanism for the initiation of 
non-LTR retrotransposon rephcation. (B) Assay to induce Zorro3 retrotransposition. A donor Zorro3 marked with a HIS3AI (47) retrotransposition 
cassette (Zorro3mHIS3AI) is present at the LYS2 locus of chromosome IL To induce retrotransposition, cells are grown on galactose and then plated 
on SC-His to select retrotransposition events. H = HaelL Bar below Zorro3 represents the probe used for Southern blots in all subsequent figures. 
(C) Example of LM-PCR products. Each lane represents the LM-PCR product from an independent Zorro3 retrotransposition event. Owing to 
variable EcoRI distances from the Zorro3 3' end. Class A LM-PCR products are not always amplified. 'mHIS3AI only' is an LM-PCR reaction from 
a strain containing only the mHIS3AI cassette. 'Donor only' is an LM-PCR reaction from a strain containing Zorro3mHIS3AI, but not a 
retrotransposition event. Class A products are noted with an asterisk. (D) Southern analysis of class A and class B events. Genomic DNA from 
the indicated clones was digested with Haell and probed for Zorro3 sequence. Arrow indicates the band representing the original Zorro3mHIS3AI 
donor. * = class A insertions, n = nicked. 1 = linear, s = supercoiled. 



As these sequences are identical homopolymers, we cannot 
unambiguously determine the source for all of the nucleo- 
sides in the poly (A) tracts. Another distinguishing feature 
between Zorro3 and mammahan LI elements is that the 
former contains a type I ORFl, whereas the latter 
contains a type II ORFl (43). Type I ORFls contain 
CCHC zinc knuckle motifs instead of the conserved mam- 
malian C-terminal domain, and are found in at least 5 
different non-LTR retrotransposon clades (including 
non-mammalian members of the LI clade), indicating 
their ancient origin (43). We previously redesigned and 
synthesized an Saccharomyces cerevisiae-com^^i\h\Q 
version of Zorro3 (40). We found that Zorro3 is able to 
retrotranspose in S. cerevisiae and maintains the require- 
ment for ORFl and ORF2 motifs. In addition, 
chromosomally integrated Zorro3 retrotransposition 
events revealed structural similarities to LI integrations 
in human and mouse, suggesting that Zorro3 retrotran- 
sposition in S. cerevisiae is mechanistically similar to 
mammalian LI retrotransposition. Here, we describe a 
previously unknown second class (class B) of 



retrotransposition products from Zorro3. Rather than 
being chromosomally integrated, class B products are 
RNA-derived DNA circles that represent the majority of 
retrotransposition products formed by Zorro3. This is the 
first time LINE elements have been shown to produce 
extrachromosomal products from the retrotransposition 
reaction. These episomal products may be a previously 
unrecognized side product or an intermediary source of 
active LINE retrotransposition. 

MATERIALS AND METHODS 

Yeast strains 

The yeast strain used in Figures 1 through 3 (JHY 339) 
has been already described (40). The Zorro3pA~ strain, 
JHY343, was generated by converting wt Zorro3 to 
Zorro3pA~ by two-step gene replacement using 
pRS406-derived plasmids, as previously described (40). 
To test nucleic acid enzyme mutants for effects on class 
B frequency, we used the strain JHY869 to knock out 
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compete open reading frames of desired genes by 
PCR-mediated disruption (44). JHY869 contains the 
Zorro3mHIS3AI and His3A200 alleles backcrossed four 
times to YAD373 (45), to generate a retrotransposition 
strain with an s288c background with high sporulation 
frequency. Our laboratory has since switched to this 
strain for all Zorro3 retrotransposition studies because it 
is isogenic with the sequenced yeast genome and the yeast 
knockout collection, and it contains three alleles 
[RMEl(ins-308A), TA03(E1493Q) and MKT1(D30G)] 
that allow it to sporulate with high efficiency for easier 
genetic manipulation. All yeast strains and genotypes are 
listed in Supplementary Table SI. 

Primers 

All primers in this study are Hsted in Supplementary 
Table S2. 

Southern analysis 

Genomic DNA was prepared as described previously (46). 
One microgram of genomic DNA was digested with the 
appropriate restriction enzyme, run on a 0.8% agarose gel, 
and blotted in 0.4 M NaOH to positively-charged nylon 
membrane (Millipore). Hybridizations were performed 
in Ultrahyb (Ambion) at 42° C. Riboprobes were 
biotin-16-UTP labelled T7 in vitro transcription products 
of JH328/JH329 PGR products (Zorro3mHIS3 probe) or 
CD15/CD16 PGR products (tubulin probe). Detection 
was performed with the Phototope-star detection kit 
(New England Biolabs). 

Class B product stability assays 

A His^ colony from a SG-His plate was patched onto a 
yeast extract peptone dextrose (YPD) plate and incubated 
at 30°G for 24 hours. Gells from this YPD plate were used 
to inoculate a 2mL YPD culture and incubated at 30°G 
for 24 h. Serial dilutions of this culture were plated on 
YPD plates, and colonies were allowed to form for 2-3 
days at 30°G. Plates with ~ 100-300 colonies were replica 
plated to both SG-His plates and new YPD plates. 
Percentage stability is defined as the number of colonies 
that grow on the SG-His plate divided by the number of 
colonies that grow on the YPD plate. 

Zorro3 retrotransposition assays and colony PCR 

Quantitative Zorro3 retrotransposition assays were per- 
formed essentially as described previously (40). When 
isolating retrotransposition events to assay for class B 
products, induction of retrotransposition was performed 
by patching strains on SG+galactose plates (instead of 
cultures) to ensure that each His^ colony was an inde- 
pendent event. The induction plates were replica plated 
to SG-His to select for His^ events. Golony PGR (cPGR) 
was performed by inoculating 2.5 |iL of H2O. This mixture 
was subjected to two freeze-thaw cycles, then used in a 
20|LiL PGR reaction containing 500 nM each primer, 
200 nM dNTPs and 0.25 units ExTaq (Takara). JH774/ 
JH775 amplifies class B-I, and JH774/JH776 amplifies 
class B-II. Gonditions were 94° G for 4 min, 42 cycles of 



94°G for 15 s, 56°G for 15 s, 72°G for 1 min, followed by a 
2-min 72° G extension. Glass B events were cloned into a 
standard TA vector for sequencing. 

RESULTS 

An unusual class of Zorro3 retrotransposition products 

We used a previously described assay (Figure IB) to detect 
Zorro3 retrotransposition events in budding yeast as His^ 
cells (40). This assay depends on the spHcing of a synthetic 
intron out of a reverse reporter (47) placed in the ZorroS 
3^ UTR. A functional HIS3 reporter is made only after 
Zorro3 transcription, splicing and reverse transcription/ 
integration. Zorro3 induction produced cells that formed 
colonies of heterogeneous sizes on SG-His plates, with a 
large proportion of small colonies. Ligation-mediated 
PGR (LM-PGR) of 3^ flanking sequence from independ- 
ently derived His^ clones revealed two main classes of 
products (schematic in Supplementary Figure SI). The 
first class was typically present in the biggest colonies 
and generated LM-PGR products of various sizes (e.g. 
Figure IG, lanes 1, 5, 7, 9). This class represents 
chromosomally integrated retrotransposition events, 
which are described in detail elsewhere (40). We will 
henceforth refer to these classic, chromosomally 
integrated products as 'class A' products. The second 
class of retrotransposition events was present in some of 
the large colonies and almost all of the small colonies, and 
gave rise to LM-PGR products with characteristic band 
sizes of either ~400 base pairs or ^1000 base pairs 
(Figure IG, lanes 13-18, 20). Sequencing revealed that 
the 3^ flanking sequence in these PGR products was 
either Zorro3 5'UTR/ORFl or Zorro3 interORF/ORF2, 
respectively (Supplementary Figure SI). We classified 
these retrotransposition events as 'class B', and further 
distinguished them as 'class B-F (3^ flanked by ORFl) or 
'class B-ir (3^ flanked by ORF2). 

Zorro3 has a preference to integrate into chromosomal 
poly(A) tracts (40,42). As the donor Zorro3 on chromo- 
some II contains poly(A) tracts at the 5^ and interORF 
regions (Figure IB), we hypothesized that class B products 
were simply integrants at these locations. To test this hy- 
pothesis, we performed Southern analysis on 
Haell-digested genomic DNA from strains containing 
class A, B-I and B-II products (Figure ID), using 
Zorro3mHIS3 sequence as a probe. As expected, class A 
products (clones 4 and 7) were integrated at a new site, 
generating a band distinct from the original donor band. 
We predicted that if class B products were integrated into 
the donor Zorro3, the original donor band would increase 
in size. However, the donor band was unaltered in class B 
strains, indicating that class B products are not integrated 
into the donor locus. Instead, class B products appeared 
as three new bands in addition to the donor band 
(Figure ID, clones 13-17). Within each subclass (B-I or 
B-II), the banding pattern was virtually identical. Because 
the His^ clones used for Southern analysis were produced 
independently, this suggests that class B-I and B-II 
products are specific structures that are reproducibly 
generated on induction of Zorro3 retrotransposition. 
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YPD SC-his percent 
clone # colonies colonies stability 
4 (class A) 327 327 100 

7 (class A) 306 306 100 

13 (class B) 355 10 3 

14 (class B) 252 16 6 

15 (class B) 318 17 5 

16 (class B) 208 4 2 

Figure 2. Instability and transfer of class B products between strains. (A) Representative examples of class A and class B product stability. 
(B) Quantitation of the experiment described in panel A. (C) Transfer of class B product between strains. Total genomic DNA purified from 
strains containing a class A or class B insertion was used to transform His" yeast cells without pre-existing Zorro3 sequence. Transformations were 
plated on SC-His. His^ colonies represent successful maintenance of the retrotransposition product under selection. (D) Class B transformants 
harbour the class B episome. Genomic DNA from His^ transformants described in panel C was digested with Haell and subjected to Southern 
analysis. The donor (arrow) and class B products are both present in the original class B strains (13-16), but only the class B product is transferred 
to the transformants (13-1, 13-2, etc.). (E) Class B products transferred to new strains exhibit instabihty. His^ stabihty testing described in panels A 
and B was repeated with class B transformants generated in panel C. 



YPD SC-his percent 
clone # colonies colonies stability 



13-1 


109 


3 


3 


13-2 


100 


3 


3 


14-1 


111 


4 


4 


14-2 


141 


3 


2 


15-1 


97 


3 


3 


15-2 


101 


4 


4 


16-1 


145 


4 


3 



16-2 147 1 1 



The three bands representing class B products bear a 
striking resemblance to the three forms of DNA (nicked, 
linear and supercoiled) often seen in plasmid preparations, 
and the intensity of the bands relative to the donor band 
suggests they are present in multiple copies. Thus, we next 
sought to investigate whether class B products were DNA 
circles. 

Class B products are DNA circles 

As class A products are integrated into a host chromo- 
some, they are expected to be faithfully replicated and 
partitioned to daughter cells along with the rest of the 
chromosome. Episomes, on the other hand, may 



eventually be lost from a dividing population, unless 
they contain a centromere or other partitioning system. 
To test whether class B products are stably transmitted 
to daughter cells, we grew class B, His^ clones under 
non-selective conditions for 2 days. The resultant popula- 
tion quickly became predominantly His" (Figure 2A and 
B). In contrast, class A clones remained 100% His^ after 
non-selective growth. These results are consistent with hy- 
pothesis that class B products are episomal circles. 

We next tested the transformation ability of class B 
products. We transformed total genomic DNA prepar- 
ations from class A or class B clones into a Zorro3~, 
His" strain. We reasoned that for class A genomic DNA 
to convert this parental strain to His^, a fragment of 
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Figure 3. Structural characterization of class B products. (A) Predicted structures and restriction fragment sizes of class B-I and class B-II circles. 
(B) Verification of predicted class B structures. Genomic DNA from donorless strains containing a class B circle were digested with the indicated 
enzyme and subjected to Southern analysis with the Zorro3mHIS3 probe shown in Figure IB. 



transformed DNA containing the spliced HISS gene 
would need to integrate into the host chromosome via 
homologous recombination or non-homologous end 
joining. Given that DNA fragments containing the HI S3 
gene are expected to be a small fraction of the total nucleic 
acid in a genomic DNA prep, transformation to His^ by 
class A DNA is a rare event (Figure 2C, left). However, 
transformation with the same amount of class B DNA led 
to comparatively high number of His^ colonies 
(Figure 2C, middle and right). This suggests that class B 
products are in the more easily transformable circu- 
lar form. To confirm this, we isolated total 
genomic DNA from 'recipient' strains (e.g. 13-1, 13-2, 
etc.) newly transformed with class B DNA and probed 
for Zorro3 sequence (Figure 2D). The respective 
class B products were present in the recipient strains, 
but the chromosomally integrated donor from the ori- 
ginal strains was not transferred (arrow in Figure 2D). 
The His^ phenotype exhibited the same instabil- 
ity shown in original class B strains (Figure 2E). 
These data are consistent with class B products being 
plasmids. 

As Zorro3 has three polyadenosine [poly(A)] tracts cor- 
responding to the locations where class B circles are 
'joined', we inferred the general structure of class B-I 
and class B-II forms (Figure 3A). We then confirmed 
these structures by restriction digests of genomic DNA 
from strains containing a class B-I or class B-II product 
(but not the original integrated donor) and probing for 



Zorro3 (Figure 3B). In all cases, we obtained the predicted 
restriction pattern, demonstrating unambiguously that 
non-LTR retrotransposition can, in some cases, result in 
circular products. The circular structure of these products 
also explains the size of the bands obtained from 
LM-PCR (shown in Supplementary Figure S2 schematic). 

Class B products can form with high frequency 

To simplify analysis of large numbers of retrotran- 
sposition events, we used a colony PCR (cPCR assay). 
Primers flanking the class B-I or class B-II junctions 
(primers 1 and 2 or primers 1 and 3 in Figure 4A) specif- 
ically detected the presence or absence of class B circles 
(Figure 4A-C). We used the cPCR assay to measure class 
B product frequencies. We induced 44 independent 
retrotransposition events. Remarkably, 24/44 clones con- 
tained a class B-I product, and 8/44 clones contained a 
class B-II product (Figure 4D, top). Thus class B 
retrotransposition can be a frequent event. As the forma- 
tion of circles is possibly augmented by a template switch 
between the poly (A) tracts of the Zorro3 mRNA and the 
A-rich top strand of the Zorro3 target site, we investigated 
whether deleting the 5^ poly(A) tract of Zorro3 
(Zorro3pA~) eliminated class B-I products. We found 
that 12/44 retrotransposition clones derived from 
Zorro3pA~ contained a class B-I event, suggesting that 
the long stretch of homology [in this case, poly(A)] is 
not required for circularization (Figure 4D, bottom). 
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Figure 4. Class B products form with high frequency. (A) A cPCR assay to detect class B products. The indicated primers (in red) were used for 
colony PCR reactions to specifically ampHfy products in strains containing class B-I or class B-II circles. (B) The cPCR assay is sensitive and specific 
for class B products. Original class B strains and strains converted to His^ by class B genomic DNA transformation (Figure 2C and D) were 
subjected to the cPCR assay. (C) His^ phenotype tracks with the class B circle. A His^ stabihty experiment (described in Figure 2E) was performed 
on parent strains transformed with a class B product. Colonies that lost the His^ phenotype (YPD, His") and rare colonies that retained the 
phenotype (YPD, His^) were tested by the cPCR assay for the presence of a class B episome. (D) Class B products constitute the majority of Zorro3 
retrotransposition events. For two strains (containing Zorro3mHIS3AI or Zorro3pA~mHIS3AI), 44 independently derived retrotransposition events 
(ah unlabelled lanes) were tested by cPCR for the presence of class B-I or class B-II products. 13 = an original class B-I clone (control). 14 = an 
original class B-II clone (control). 



Pre-existing homology is not required for class B 
product formation 

To learn more about the nature of the junctions of class B 
circles (junction defined as the region corresponding to 
where the 5^ and 3^ ends of the retrotransposed element 
are joined to form the circle), we cloned and sequenced 
the junctions of five independent class B-I and class B-II 
clones derived from either wild-type Zorro3 or Zorro3pA~ 
(Figure 5A-E). Sequencing revealed that 3/5 wild-type 
class B-I clones (#1, 2, 5 from Figure 5B) comprised 
full-length Zorro3 sequence with the 3^ and 5^ ends joined 
by a poly (A) tract. The only difference between these clones 



was the length of the joining poly(A) tract. A fourth clone 
(#4) was nearly identical, but missing two Gs at the 3^ end 
before the poly(A) junction. This could be due to an alter- 
native cleavage and polyadenylation site, or internal 
priming of Zorro3 mRNA during the initial step of 
reverse transcription (48). The remaining clone (#3) con- 
tained 113 base pairs of sequence upstream of the 5^ 
poly(A) tract, preceded by an untemplated G, suggesting 
that this clone was derived from an mRNA that initiated 
transcription upstream of the 5^ poly(A) tract. 

The junctions of class B-II products derived from 
Zorro3 and Zorro3pA~ were essentially the same, with 



10872 Nucleic Acids Research, 2012, Vol. 40, No. 21 



0RF1 



|-A19^ 



GAL1 promoter 



5' poly(A) region: 



0RF2 



|S8| [TRI A20 



0RF1 



TATTACTTCTTATTCCTCTACCGCCTCGAGAAAAAAAAAAAAAAAAAAAATTATTATGACTACGAACGGTACTAATGGTT 



interORF region 



interORF region 



interORF poly(A) region : tactaaaacaaaaactgataaaaacactagaaaaaaaaaaaaaaaaaaagagaaagacaacaaattcatattcatacat 



3' poly(A) region: 



3' UTR 



ATATAAG G ACTAAATATATGTTG AG AAAG G AAAAAAAAAAAAAAAAAAAAG G ATCCCCCTATTTATTTTTTTATAGTTAT 



B 

wt ZorroS class B-l clones: 

1 ) ...TATGTTGAGAAAGGA33TTATTATGACTA... 

2) . . TATGTTG AG AAAGG AggTTATTATG ACTA. . . 

3) ...TATGTTGAGAAAGGA24GGATGATTTTT -- 93 bp ■ ■ CCGCGTCGAGA2qTTATTATGACTA.. 

4) . . TATGTTG AG A4gTTATTATG ACTA. . . 

5) . . .TATGTTG AG AAAGGA^^TTATTATG ACTA. . . 

D Zorro3pA~ class B-l clones: 

1 ) ...TATGTTGAGAAAGGA25TAGAACGGTACTAA... 

2) ...TATGTTG AG AAAG GA25GATAACCACTT - 49 bp ■■ CCGCCTCGAGTTATTATGACTA... 

3) ...TATGTTGAGAAAGGA.0TCTGCTTGTTGCTTCTTCTTTTAAGATAGGATCATCCGC... 

35S rRNA 5' ETS 

4) ... TATGTTG AG AAAGGAggTAACCACTTT - 48 bp ■ ■ CCGCCTCGAGTTATTATGACTA... 



5) ...TATGTTGAGAAAGGA^ ^TTAGATGGTGGTC, 




Wt Zorro3 class B-ll clones: 

1 ) ...TATGTTGAGAAAGGAggGAGAAAGACAAC... 

2) . . .TATGTTG AG AAAGGA2qG AG AAAG ACAAC. . . 

3) . . .TATGTTG AG AAAGGA2gG AG AAAG ACAAC. . . 

4) . . .TATGTTG AG AAAGGA43G AG AAAG ACAAC. . . 

5) ...TATGTTGAGAAAGGA^ ^G AG AAAG ACAAC... 

E Zorro3pA~ class B-ll clones: 

1 ) ...TATGTTGAGAAAGGA^4GAGAAAGACAAC... 

2) ...TATGTTGAGAAAGGA^ qqGAGAAAGACAAC. 

3) . . .TATGTTG AG AAAGGA25G AG AAAG ACAAC. . . 

4) . . .TATGTTG AG AAAG GA^ gG AG AAAG ACAAC. . 

5) . . .TATGTTG AG A2qG AG AAAG ACAAC. . . 




Figure 5. Pre-existing homology is not required for class B product formation. (A) Structure of Zorro3 donor and view of Zorro3 regions where 
most class B junctions occur. Class B-I circles typically form from a joining between the 3' poly(A) region and the 5' poly(A) region, whereas class 
B-II circles typically form from a joining between the 3' poly(A) region and the interORF poly(A) region. Zorro3pA~ sequences are identical to 
Zorro3 except the stretch of 20 (A)s in the 5' poly(A) region is deleted. (B-E) Sequences of cloned class B product junctions derived from wild-type 
Zorro3 or Zorro3pA~ donors. Untemplated nucleotides are indicated in green. Yeast genome sequences are indicated in brown. 



variations in poly(A) length being the only difference 
between clones (Figure 5C and E). One clone 
(Zorro3pA~ #5) was missing two Gs at the y end, but 
otherwise identical to the rest. The similarities between 
Zorro3 and Zorro3pA~ class B-II clones were expected, 
as no modifications were made to the interORF poly(A) 
tract in Zorro3pA~. 



The aforementioned data suggests that, in most cases, 
stretches of homology [poly(A) tracts] between the 5^and 
y ends of the element serve as the preferred connecting 
junction for class B circles. As other notable LINE 
elements (e.g. human and mouse LI) do not contain 
pre-existing homology between the 5^ and y ends, we 
analysed Zorro3pA~ class B-I products to ask whether 



Nucleic Acids Research, 2012, Vol 40, No. 21 10873 



Genotype 


Class B-I 
(%) 


Class B-II 
(%) 


Total Class B 
(%) 


wt 


48 


24 


72 


radlA 


20 


19 


39 ** 


radlOA 


40 


15 


55 * 


mms4A 


44 


24 


68 


musSlA 


46 


32 


78 


slxlA 


32 


31 


63 


slx4A 


44 


21 


65 


rnhlA 


10 


61 ** 


71 


rnhlOlA 


41 


26 


67 


radlA/slx4A 


28 


11 


39 ** 


rnhlA/rnhlOlA 


22 


47 ** 


69 



B 




rad1A rnh1A 



Figure 6. Mutations affecting class B frequency and ratio. (A) Zorro3 was induced for retrotransposition in the indicated mutants. In all, 94 
independently derived clones were picked for each mutant and assayed by cPCR for the presence of Class B products. **P< 0.001. *P<0.05. 
(B) Loss of RNHl leads to an increase in total Zorro3 retrotransposition. A quantitative retrotransposition assay was performed on the indicated 
strains. Frequency is presented as the average of five independent cultures. Black bars indicate the range of frequencies obtained. 



the pre-existing 5^ and y homology in Zorro3 was an 
absolute requirement for class B circle formation. When 
the 5^ poly(A) tract was deleted, class B-I products still 
formed, but with greater heterogeneity and non-templated 
nucleotides at the junction (Figure 5D). Non-templated 
nucleotides are similarly seen at mammalian LI 5^ junc- 
tions (36,37). It has been proposed that these 
non-templated nucleotides are added onto the y end of 
the minus strand by reverse transcriptase, facilitating 
microhomology-mediated template switching (49). Two 
of the truncation points (clones #2, 4) are upstream of 
the beginning of the element [where the 5' poly (A) tract 
would normally be]. This upstream sequence may repre- 
sent transcription initiating upstream of the Zorro3 start. 
Two of the truncation points (clones #1, 5) are down- 
stream in the element. The final clone 5^ truncates down- 
stream in Zorro3 sequence, but template switches to 35S 
rRNA sequence before template switching to Zorro3 y 
sequence. In a prior analysis of class A clones, we saw 
similar cases of template switching to endogenous 
cellular sequences (40). There is no appreciable 
pre-existing homology at the junction in any of the 
Zorro3pA~ class B-I clones. In 3/5 cases, there were 
untemplated bases at the junction point before the 
template switch. This demonstrates that pre-existing 
element junction homology is not a requirement for the 
formation of class B products. 

Mutations affecting class B frequency and ratio 

Zorro3 retrotransposition events (which, as shown earlier, 
are predominantly class B products) are dependent on 
conserved residues in ORFl, ORF2 endonuclease and 
ORF2 reverse transcriptase (40). Thus, it is likely that 
class B products are formed via a variation of TPRT. 
The endonuclease requirement, in particular, suggests 
that for class B circles, the reaction requires nicked 
chromosomal DNA as the initiating primer for reverse 
transcription. This would mean that even retrotran- 
sposition intermediates destined to become circles would 
be initially attached to a chromosome. 

Based on this assumption, we explored how a non-LTR 
retrotransposition intermediate could be excised from a 



chromosome. TPRT predicts the formation of y flaps 
on the top and bottom strands of a retrotransposition 
intermediate (see Figure lA, bottom for an example of 
the bottom strand flap). Cleavage of both of these flaps 
by a cellular endonuclease(s) could lead to excision of the 
LI intermediate. Previous work has shown that the XPF/ 
ERCCl, a y flap endonuclease involved in nucleotide 
excision repair and homologous recombination, can 
restrict human LI retrotransposition (50). This suggests 
a potential interaction between LI intermediates and y 
flap endonucleases. 

To test whether the loss of y flap endonucleases in yeast 
effects the formation of class B products, we knocked-out 
various genes involved in recognition and cleavage of 
branched nucleic acid structures. Radl/RadlO is the 
yeast ortholog of the mammahan XPF/ERCCl 
heterodimer mentioned earlier. Mus8 1 /Mms4 is an endo- 
nuclease involved in HoHday junction resolution with a 
preference for y flap substrates (51,52). Slx4 is thought 
to recognize DNA structures and serves as a scaffold to 
recruit endonucleases, such as Radl/RadlO, Mus81/ 
Mms4 and Slxl (a 5^ flap endonuclease) (53). In each of 
these mutant strains, we induced Zorro3 retrotran- 
sposition and assayed the percentage and ratio of class B 
events (Figure 6A). The most striking effects on formation 
of class B products were found in radlA and radlOA 
strains, which showed a significant decrease in total per- 
centage of class B products (P = 9 x 10~^ and P = 0.022, 
respectively; Fisher's exact test). Although Slx4 can form 
a complex with Radl/RadlO, deletion of SLX4 did not 
lead to dramatic changes in class B frequency. 
In addition, a radlA/slx4A double mutant showed a 
similar phenotype as the radlA single mutant. Together, 
these data suggest that Radl/RadlO can act inde- 
pendently of Slx4. Although the percentage of class B 
product formation decreased in radlA mutants, overall 
levels of retrotransposition remained largely unchanged 
(Figure 6B). Our data suggest that Radl/RadlO may 
play a role flap in cleavage during class B circle release 
from the chromosome. 

Although LINE reverse transcription is usually initiated 
by endonuclease mediated nicks, in some cases, DNA 
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breaks formed by other means can serve as a primer for 
reverse transcription (54,55). We sought to genetically 
induce DNA breaks by deleting RNaseH. rnhlA strains 
are expected to lead to a relative persistence of RNA/ 
DNA hybrids. Multiple studies suggest that persistent 
RNA/DNA hybrids lead to DNA breaks (56-59). We pre- 
dicted that an environment with preexisting DNA breaks 
would provide more potential chromosomal primers for 
LINE minus and/or plus strand synthesis. In the case of 
ZorroS, which has a low rate of retrotransposition 
(~1 X 10~^ under the conditions of our assay), this may 
lead to an increase in retrotransposition frequency. 
Indeed, we found that in an rnhlA strain, overall Zorro3 
retrotransposition frequency increases by greater 
than 10-fold (Figure 6B). Importantly, class B products 
still comprised ~70% of these rnhlA retrotransposition 
events, indicating that class A and class B products 
are equally increased by introducing more potential 
chromosomal primers. This further strengthens our hy- 
pothesis that class B products are initiated at the 
chromosome. 

At some point during TPRT, cleavage of the top strand 
must occur. The newly synthesized minus strand can then 
jump templates from retrotransposon RNA to the newly 
cleaved top strand. We hypothesized that in an rnhlA 
strain, the abundant availability of chromosomal breaks 
to serve as a primer (especially if a break is made at the 
target site) could allow premature minus strand template 
jumping and plus strand synthesis. EarHer minus strand 
template jumping [e.g. from the interORF poly(A) tract 
instead of the 5^ UTR poly(A) tract] would be expected to 
give rise to a greater proportion of class B-II products. 
Consistent with this hypothesis, in rnhlA strains, we 
observe a dramatic shift in the ratio of class B-I to class 
B-II products. Wild- type class B products are 33% class 
B-II, whereas rnhlA class B products are 85% class B-II 
(P = 3 X 10~^^; Fisher's exact test). Again, these results 
are consistent with minus and plus strand synthesis 
being initiated from the chromosome. 



DISCUSSION 

We have shown that in addition to the expected integrated 
retrotransposition events, a LINE element can also 
generate episomal retrotransposition events. Previously, 
circular products derived from retroviruses/LTR- 
retrotransposons have been observed (60-62). However, 
retroviruses/LTR-retrotransposons replicate in a funda- 
mentally different manner when compared with 
non-LTR retrotransposons. Retroviruses/LTR- 
retrotransposons complete Hnear cDNA synthesis in the 
cytoplasm. The formation of retroviral/LTR- 
retrotransposon circles is postulated to occur by various 
mechanisms, including homologous recombination 
between the 5^ and y LTRs, non-homologous end 
joining of the linear cDNA ends, errors in reverse tran- 
scription and auto-integration (63-69). As these mechan- 
isms involve retroviral intermediates, they seem unlikely to 
explain the formation of LINE circles. A less-specific class 
of circular DNA, small dispersed circular DNA 



(spcDNA), has been found to be present in essentially 
all mammahan cells (70,71). spcDNA consists of hetero- 
geneous circles of sizes from hundreds to thousands of 
base pairs, and spcDNA is increased in cells with 
genome instability (70,71). spcDNA appears to be 
derived from chromosomal DNA and contains unique 
sequence, non-coding sequence and repetitive elements 
(including Alus and Lis). As short interspersed nuclear 
elements (SINEs) and LINEs are among the most 
abundant sequences in the genome, it is not surprising 
that circularized chromosomal fragments would contain 
these elements owing to their sheer percentage of 
genome mass. There is a variety of proposed mechanisms 
for the formation of spcDNA (72), the most Hkely being 
'pop-out' of existing genomic DNA. Thus, the work 
described here is the first example of LINE circle forma- 
tion via bona fide retrotransposition. 

How are class B products formed? As mentioned earher, 
class B products are normally endo-dependent. This 
suggests that chromosomal nicks are used as primers for 
reverse transcription, and thus class B products initiate 
attached to a chromosome. The requirement of ORFl, 
ORF2 endo and ORF2 reverse transcriptase strongly 
suggests that class B products are formed by TPRT, 
with a variation in retrotransposition intermediate reso- 
lution. The massive increase in both class A and class B 
retrotransposition in rnhlA strains further supports a 
TPRT-based model for class B formation, as rnhlA 
strains are expected to contain excess DNA breaks, a sub- 
strate for TPRT (54-59). Based on our data, we present a 
speculative model for circle formation in Figure 7. The 
steps in Figure 7A-E are identical to a standard TPRT 
reaction. We speculate that a structure-specific endonucle- 
ase can cleave the top and bottom strands in the vicinity of 
the y flaps. As deletion of the Radl/RadlO y flap endo- 
nuclease significantly reduces class B product formation, 
Radl/RadlO may play a role in such a cleavage. However, 
Radl/RadlO cannot be solely responsible for this step, as 
loss of Radl/RadlO does not lead to the complete elimin- 
ation of class B products. It is possible that other cellular 
nucleases play a redundant role, or even that the Zorro3 
endonuclease may perform the recleavages. Further work 
is needed to distinguish between these possibiHties. Once 
cleaved from the chromosome, the ends of the predicted 
linear excision product may contain genomic DNA cor- 
responding to the genomic target site (highlighted in green 
in Figure 7F and G). These ends could anneal and be 
repaired by DNA repair pathways to form a circle. As 
Zorro3 targets poly(A) tracts, the highUghted green 
sequence in Figure 7F and G is likely to be poly(A), 
giving the appearance of a pure poly(A) junction. If this 
model were extrapolated to other LINE elements, the 
highUghted green segment of DNA would correspond to 
part of the target site duplication, and the size would be 
dependent on the distance between the top and bottom 
strand recleavages. 

An obvious alternative to the model in Figure 7 is the 
'pop-out' of stably integrated Zorro3 retrotransposition 
products owing to homologous recombination between 
the flanking poly (A) tracts. Several Hnes of evidence 
argue against this possibiHty. First, class B products 
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Figure 7. A speculative model for the formation of class B products. (A) Genomic target site. (B) Bottom chromosome strand nick and LINE 
mRNA anneahng. (C) Minus strand synthesis. (D-E) Top strand chromosomal nick and template jump to top strand. (F) Recleavage of top and 
bottom strands (at arrowheads) to release the retrotransposition intermediate. The timing and placement of recleavages do not necessarily need to 
occur as indicated in the figure. Chromosomal target site DNA between the sites of recleavage have been highhghted in green for emphasis. As 
Zorro3 integrates into poly(A) tracts, the DNA highhghted in green would hkely be poly(A) sequence. For other LINE elements, the DNA in green 
would Hkely correspond to part of the target site duphcation. (G) Resolution of the excised circle by anneahng of complementary strands. Red fines 
indicate RNA. Blue lines indicate newly synthesized retrotransposon DNA. 



were initially observed in rad52A strains, which are defi- 
cient in homologous recombination (Figures 1-5). Second, 
the portion of retrotransposition products that are class B 
is similar between a rad52A strain (Figure 4D) and 
Rad52^ strain (Figure 6 A, wt). Third, we do not see 
clones that contain mixtures of class A/class B events or 
class B-I/class B-II events. We also have never witnessed 
the conversion of class A to class B products, or conver- 
sion of class B-I to the smaller class B-II products. 
This indicates that these retrotransposition products 
are independently generated and not derived from each 
other. 

We do not currently know whether class B products can 
be formed by other LI clade members. The majority of 
investigation on the LI clade has focused on human and 
mouse LI, and thus far circular retrotransposition 
products have not been observed for these elements. 
However, the current assay for mammalian retrotran- 
sposition is not designed to detect circular retrotran- 
sposition products. Generally, unless a plasmid is 
specifically engineered to be maintained as an episome, 
clones derived from mammahan plasmid transfection 
and selection are the result of plasmid integration into 



the chromosome. Thus, if class B products are formed 
during mammahan retrotransposition assays, they are 
likely lost or integrated into the chromosome under the 
current assay schemes. We detected class B products in 
yeast owing to the fortuitous abihty for ZorroS class B 
products to be maintained in yeast under selection. This 
suggests that some subset of Zorro3 sequence can act as an 
autonomous replicating sequence in yeast. The abihty for 
foreign sequence to replicate in yeast is not unprecedented 
(73), and we view the Zorro3 autonomous replicating se- 
quence activity as serendipity. Even if other LI clade 
members form class B products in their native hosts, we 
do not necessarily expect these products to contain an 
origin of replication. 

Although we would expect any particular non-LTR 
retrotransposition-derived circle to exist only transiently, 
class B products may not necessarily be 'dead ends', as 
some LINEs (notably human and mouse LI) encode a 
promoter on their mRNA. Thus, if a mammahan 
fuh-length class B product is formed in a ceh, this circle 
could in principle be a source for future retrotransposition 
events. As the circles would be newly synthesized DNA 
divorced from the chromosomal context, they may escape 
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silencing and could be even more retrotranspositionally 
active (on a per copy basis) than chromosomal copies. 
This would be an example of how a transposon, in this 
case, a LINE element, could circumvent apparent 'death 
by excision', to potentially jump again. Further studies are 
needed to assess whether class B products can be formed 
in other organisms, especially in evolutionarily relevant 
cellular niches (e.g. the germ line of mammals). 

SUPPLEMENTARY DATA 
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